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PATENT APPLICATION: US/10/528,872 



DATE: 04/01/2005 
TIME: 10:57:13 



Input Set : A:\0185010SeqList.txt 
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3 


<110> 




5 


<120> 




7 


<130> 


c- -> 


9 


<140> 


c- -> 


10 


<141> 




12 


<150> 




13 


<151> 




15 


<160> 




17 


<170> 




19 


<210> 




20 


<211> 




21 


<212> 




22 


<213> 




25 


<220> 




26 


<221> 




27 


<222> 




29 


<220> 




30 


<221> 




31 


<222> 




33 


<220> 




34 


<221> 




35 


<222> 




37 


<220> 




38 


<221> 




39 


<222> 




41 


<220> 




42 


<221> 




43 


<222> 




45 


<220> 




46 


<221> 




47 


<222> 




49 


<220> 




50 


<221> 




51 


<222> 




53 


<220> 




54 


<221> 




55 


<222> 




57 


<220> 




58 


<221> 




59 


<222> 




61 


<220> 




62 


<221> 




63 


<222> 



APPLICANT: DSM IP ASSETS B.V. 
TITLE OF INVENTION: SQS gene 
FILE REFERENCE: NDR5218 

CURRENT APPLICATION NUMBER: US/10/528,872 
CURRENT FILING DATE: 2005-03-23 

PRIOR APPLICATION NUMBER: EP 02021619.8 

PRIOR FILING DATE: 2002-09-27 

NUMBER OF SEQ ID NOS : 8 

SOFTWARE: Patentln version 3.2 

SEQ ID NO: 1 

LENGTH: 4807 

TYPE: DNA 

ORGANISM: Phaffia rhodozyma 
FEATURE: 
NAME /KEY: 5 ? UTR 
LOCATION: (1469) . . (1470) 
FEATURE: 
NAME /KEY: exon 
LOCATION: (1550) . . (1577) 
FEATURE : 

NAME/KEY: Intron 
LOCATION: (1578) . . (1752) 
FEATURE: 
NAME/KEY: exon 
LOCATION: (1753) . . (1766) 
FEATURE: 

NAME/KEY: Intron 
LOCATION: (1767) . . (1882) 
FEATURE : 
NAME/KEY: exon 
LOCATION: (1883) . . (2071) 
FEATURE: 

NAME/KEY: Intron 
LOCATION: (2072) . . (2182) 
FEATURE: 
NAME /KEY: exon 
LOCATION: (2183) . . (2397) 
FEATURE : 

NAME /KEY: Intron 
LOCATION: (2398) . . (2474) 
FEATURE: 
NAME/KEY: exon 
LOCATION: (2475) . . (3087) 



0088 Not Comply 

'Neecfe' 
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65 <220> FEATURE: 

66 <221> NAME /KEY : Intron 

67 <222> LOCATION: (3088) . . (3230) 

69 <220> FEATURE: 

70 <221> NAME /KEY: exon 

71 <222> LOCATION: (3231) . . (3356) 

73 <220> FEATURE: 

74 <221> NAME/KEY: Intron 

75 <222> LOCATION: (3357) . . (3453) 

77 <22 0> FEATURE: 

78 <221> NAME /KEY: exon 

79 <222> LOCATION: (3454) . . (3475) 

81 <22 0> FEATURE: 

82 <221> NAME/KEY: Intron 

83 <222> LOCATION: (3476) . . (3564) 

85 <220> FEATURE: 

86 <221> NAME/KEY: exon 

87 <222> LOCATION: (3565) . . (3881) 

89 <220> FEATURE: 

90 <221> NAME/KEY: Intron 

91 <222> LOCATION 

93 <220> FEATURE: 

94 <221> NAME/KEY: exon 

95 <222> LOCATION: (3959) . 

97 <220> FEATURE: 

98 <221> NAME/KEY: polyA_s 

99 <222> LOCATION: (4106) . 

101 <400> SEQUENCE: 1 

102 gttcctgttc agtcaaagag 
104 ggtcagaaca tcggagatac 
106 gtgaggtttg cctaggaagt 
108 gatgaacgac atgtcgaacc 
110 ttccagcttt tctgctctct 
112 agtcgatgtt ctgtcgacag 
114 ggaggacagg gtggctttaa 
116 tcgaaggttg actcctcttg 
118 atttcctttt ctttctaccc 
120 ctgaccgatc cgaatatcta 
122 aggttcatgt ttgaaagcat 
124 agcaggctca atgatcactt 
126 gtcgagattc ttttttcttc 
128 gcgggggatc acccgcatat 
130 tcataggtga aggttaaaac 
132 cgacttgggc agcctcgtcc 
134 tggcgggttc gtcatggccg 
13 6 aatgacagtt tcccgacgcc 
13 8 actgaagaag gtagggtctc 
140 cttgatcaga cccgacttat 
142 ctgctctttc gtgtttttca 



(3882) . . (3958) 



. (3970) 
ite 

. (4107) 

tgggaaaaac 
aatggcccat 
aatcccttcg 
catctccatc 
ccagtttcgc 
gagaccagta 
caaatcggta 
ctatgtgtat 
ggagagtaag 
gcgcaggttg 
tgatcctagt 
ggggtttgtg 
ttttggtcga 
taagcggtat 
ggaatggata 
atagtgtctg 
tgatcatctg 
atcactaaga 
gtcgagccag 
gaatatggcc 
ctttctttcc 



atgaaagtaa 
agaggaagga 
tttctcaaag 
ctcgaaatca 
agctttctct 
gaaggcggaa 
gtacggagga 
gagagcatat 
acacacaaag 
cttctctact 
tgcctctatc 
catcttgatg 
gaaaaaaaaa 
gacgctcatc 
ggaggagcta 
atggttatat 
ctttgttaga 
cacaaacgta 
tgcaaccaga 
gttattgtac 
ggatcaaacg 



aaagatgtaa 
aagctactta 
atatcttttt 
agtttactcg 
tcgggaagaa 
ccgacaattt 
tcgaacggcg 
ccgttgatgt 
aatcacgaag 
ggttccattc 
tgaggccagt 
ttcaaccaag 
cggcttcgct 
aaccggccaa 
accacgtttt 
cgtcatagaa 
cattgtccat 
tccagcacgc 
gttacagatg 
acttcttggt 
agactgctcg 



tgaaagaagg 
ccagaaacca 
tgaaagcatc 
atttagacct 
gctctccgcc 
tggatggatc 
cttctctcgt 
ctcagttaaa 
aatatgatga 
ttcgaacgat 
ctgccaatgt 
tgtcgcaacg 
tcgcacgcgc 
gtgttcttca 
tattttaatt 
aggcagcgcc 
cagtcacctc 
catgtccatc 
aacatcaggc 
gctcctcgag 
tgtatctatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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144 


tgtgcttgcc atatgagcat cccatgcctc tgctcaaatg 


atgctggagc tacgatccat 


1320 


146 


cagagacgac acaaaacggg gttgtatgaa ctctacattt 


cctaatgtta ttggaatttt 


1380 


148 


ctgtaatgcg ttcttcatct ttctctaatg cttttttgta gtccgtcttt tcaaccttgc 


1440 


150 


cagcgtttcg cgtgtcttct ttctcctttg acggtcatca 


ctttcttctc tettctegtt 


1500 


152 


ctttcttccg tccttccttc tctctcttcg tctgaacatc 


agcatcatc atg ggc ata 


1558 


153 






Met Gly He 




154 






1 




156 


tea 


gat tac etc gtt ctg g gtcagttctg tcttttgttt gattcttatc 


1607 


157 


Ser Asp Tyr Leu Val Leu 






158 




5 






160 


ttcttgeegg cggtcgcctg tcttgggtat atcatcagca 


atgagaaaca tgatgttccc 


1667 


162 


cccgcgtcaa tcactgacct tttggtcctc tacttctttc 


ctgtcgaatt gatcctgatt 


1727 


164 


gatacgtgtg ccggctgctt aacag ct ttc acg cat 


cct gtaggtgttt 


1776 


165 




i^a§aA \ Ala Phe Thr His 


Pro 




W--> 168 


tategtatge ttcatgttga tgtttagrca cgcggactga 


cctggccggt tgattttctg 


1836 


170 


tatgatcget tgtgctaccg tctttcttgg aaatccttcc 


catcag gee gat ctg 


1891 


171 






Ala Asp Leu 




172 






15 




174 


cga 


get tta atg cag tac gcg ate tgg cat gag 


cct cga agg aat ate 


1939 


175 


Arg 


Ala Leu Met Gin Tyr Ala lie Trp His Glu 


Pro Arg Arg Asn He 




176 




20 25 


30 




178 


act 


gca cag gag gaa cat gca aca tec ggt tgg 


gac cga gaa act atg 


1987 


179 


Thr 


Ala Gin Glu Glu His Ala Thr Ser Gly Trp 


Asp Arg Glu Thr Met 




180 




35 40 


45. 




182 


aag 


gaa tgt tgg aag tat ttg gat ctg act tea 


aga agt ttc gca get 


2035 


183 


Lys 


Glu Cys Trp Lys Tyr Leu Asp Leu Thr Ser 


Arg Ser Phe Ala Ala 




184 


50 


55 60 


65 




186 


gtc 


ate aaa gag ttg gac gga gat ctt acc cga 


gtc gtacgtgttt 


2081 


187 


Val 


lie Lys Glu Leu Asp Gly Asp Leu Thr Arg 


Val 




188 




70 75 






190 


tcatcttctc tctcctttga gatctggtcg cctccgcatt 


ttcttgttgc agaagggtca 


2141 


192 


gaagctgaca acaccatctc tactgttcgg gaeaeggcta 


g ate tgt tta ttc tat 


2197 


193 






He Cys Leu Phe Tyr 




194 






80 




196 


etc 


get ctt cga gga ctg gat acc att gag gat 


gac atg agt eta tct 


2245 


197 


Leu 


Ala Leu Arg Gly Leu Asp Thr lie Glu Asp 


Asp Met Ser Leu Ser 




198 




85 90 


95 




200 


aat 


gat gtg aag ctt ccc ctg ctt egg aca ttc 


tgg gaa aag ctt gac 


2293 


201 


Asn 


Asp Val Lys Leu Pro Leu Leu Arg Thr Phe 


Trp Glu Lys Leu Asp 




202 




100 105 


110 




204 


tec 


cct ggg tgg acc ttt act gga tec ggt cca 


aat gag aag gat aga 


2341 


205 


Ser 


Pro Gly Trp Thr Phe Thr Gly Ser Gly Pro 


Asn Glu Lys Asp Arg 




206 


115 


120 125 


130 




208 


gag 


ctt ctt gtt cac ttc gat gtg gee ate gee 


gag ttt gee aac ttg 


2389 


209 


Glu 


Leu Leu Val His Phe Asp Val Ala He Ala 


Glu Phe Ala Asn Leu 




210 




135 140 


145 




212 


gac 


gtc aa gtgagtttcc ctttatggtt ggatcatccg ctcgacagac 


2437 


213 


Asp 


Val Asn 






216 


tegaaacget catcactttg gtctgcttga tgaacag c tct egg aac gtc att 


2490 
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217 




















Ser Arg Asn Val He 




218 




















150 












220 


cga gac 


ate 


act 


cgc 


aag 


atg 


ggt 


aac 


ggt 


atg 


gee 


gac 


ttt 


get 


tct 


2538 


221 


Arg Asp 


He 


Thr 


Arg 


Lys 


Met 


Gly 


Asn 


Gly 


Met 


Ala 


Asp 


Phe 


Ala 


Ser 




222 


155 








160 










165 










170 




224 


etc tct 


acg 


ccc 


tec 


aag 


cct 


gtg 


gee 


gag 


gtc 


cag 


teg 


acc 


gaa 


gat 


2586 


225 


Leu Ser 


Thr 


Pro 


Ser 


Lys 


Pro 


Val 


Ala 


Glu 


Val 


Gin 


Ser 


Thr 


Glu 


Asp 




226 








175 










180 










185 






228 


ttc aac 


eta 


tac 


tgt 


cat 


tac 


gtc 


get 


gga 


etc 


gtc 


ggc 


gag 


gga 


etc 


2634 


229 


Phe Asn 


Leu Tyr 


Cys 


His 


Tyr 


Val 


Ala 


Gly 


Leu 


Val 


Gly 


Glu 


Gly 


Leu 




230 






190 










195 










200 








232 


tec cga 


etc 


ttt 


gtc 


gcg 


acc 


gag 


aag 


gaa 


cga 


cca 


ttc 


ttg 


gee 


aac 


2682 


233 


Ser Arg 


Leu 


Phe 


Val 


Ala 


Thr 


Glu 


Lys 


Glu 


Arg 


Pro 


Phe 


Leu 


Ala 


Asn 




234 




205 










210 










215 










236 


cag atg 


gta 


ctt 


tea 


aac 


teg 


ttc 


gga 


etc 


ctt 


etc 


caa 


aag 


aca 


aac 


2730 


237 


Gin Met 


Val 


Leu 


Ser 


Asn 


Ser 


Phe 


Gly 


Leu 


Leu 


Leu 


Gin 


Lys 


Thr 


Asn 




238 


220 










225 










230 












240 


ate ctt 


cga 


gat 


att 


egg 


gag 


gac 


gee 


gac 


gaa 


ggt 


cgt 


ggc 


ttc 


tgg 


2778 


241 


lie Leu 


Arg Asp 


He 


Arg 


Glu 


Asp 


Ala 


Asp 


Glu 


Gly 


Arg 


Gly 


Phe 


Trp 




242 


235 








240 










245 










250 




244 


cca aga 


gag 


ate 


tgg 


gee 


aac 


ccg 


ate 


tat 


act 


gcg 


cat 


gca 


ccg 


ggc 


2826 


245 


Pro Arg 


Glu 


He 


Trp 


Ala 


Asn 


Pro 


He 


Tyr 


Thr 


Ala 


His 


Ala 


Pro 


Gly 




246 








255 










260 










265 






248 


aca agg 


ttt 


aac 


teg 


ttg 


act 


gac 


ctg 


gtc 


aag 


aaa 


gaa 


aac 


ate 


gac 


2874 


249 


Thr Arg 


Phe 


Asn 


Ser 


Leu 


Thr 


Asp 


Leu 


Val 


Lys 


Lys 


Glu 


Asn 


He 


Asp 




250 






270 










275 










280 








252 


aaa gga 


tea 


atg 


tgg 


gtg 


ttg 


agt 


gcg 


atg 


aca 


etc 


gac 


gcg 


ate 


acc 


2922 


253 


Lys Gly 


Ser 


Met 


Trp 


Val 


Leu 


Ser 


Ala 


Met 


Thr 


Leu 


Asp 


Ala 


He 


Thr 




254 




285 










290 










295 










256 


cat act 


acc 


gac 


gca 


ctg 


gac 


tac 


etc 


tea 


ctt 


eta 


aag 


aac 


cag 


agt 


2970 


257 


His Thr 


Thr Asp 


Ala 


Leu 


Asp 


Tyr 


Leu 


Ser 


Leu 


Leu 


Lys 


Asn 


Gin 


Ser 




258 


300 










305 










310 












260 


gtt ttc 


aac 


ttt 


tgt 


get 


ate 


ccg 


get 


gtc 


atg 


teg 


att 


gca 


acg 


ttg 


3018 


261 


Val Phe 


Asn 


Phe 


Cys 


Ala 


He 


Pro 


Ala 


Val 


Met 


Ser 


He 


Ala 


Thr 


Leu 




262 


315 








320 










325 










330 




264 


gag eta 


tgc 


ttc 


atg 


aac 


cca 


gcg 


gtg 


ttc 


caa 


cga 


aac 


ata 


aaa 


ate 


3066 


265 


Glu Leu 


Cys 


Phe 


Met 


Asn 


Pro 


Ala 


Val 


Phe 


Gin 


Arg 


Asn 


He 


Lys 


He 




266 








335 










340 










345 






268 


aga aag 


gga 


gaa 


gee 


gtc 


gag 


gtgcgttcgc gcgttctgtt tctacctttc 


3117 


269 


Arg Lys 


Gly Glu 


Ala 


Val 


Glu 






















270 






350 




























272 


ataacattgg aggttcttga etcttaageg tcttccaatc 


tgatgcctcc aattatcatc 


3177 


274 


atttttgtct tttttgcttt cctcttgttt etttteggeg 


tgattcaatc cag etc 


3233 


275 




























Leu 




278 


att atg 


aag 


tgc 


aac 


aac 


cct 


egg 


gag 


gtg 


gca 


tac 


atg 


ttt 


aga 


gat 


3281 


279 


He Met 


Lys 


Cys Asn Asn 


Pro 


Arg 


Glu 


Val 


Ala 


Tyr Met 


Phe Arg Asp 




280 


355 








360 










365 










370 




282 


tat get 


cga 


aag 


att 


cat 


gee 


aag 


get 


att 


cct 


aca 


gat 


cct 


aac 


ttc 


3329 


283 


Tyr Ala Arg Lys 


He 


His 


Ala 


Lys 


Ala 


He 


Pro 


Thr Asp 


Pro 


Asn 


Phe 
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284 375 380 385 

286 ate aag ttg age gtt gcg tgt ggt cga gtgagttgat cgatcgatcc 3376 

287 lie Lys Leu Ser Val Ala Cys Gly Arg 

288 390 395 

290 atcttttgtt ttgatcatcg cgagacttga ctgatcgatt actcaaaaca tcatcgcttc 3436 

292 tccttcttgc tctctag ate gaa caa tgg get gag cac t gtatgttcct 3485 

293 He Glu Gin Trp Ala Glu His 

294 400 

296 ccgcccctcc ttcaagtttc ctctcgcttc atctttgttg agaagaggga tctgatgtat 3545 

298 ctttctttgt tcggatcag ac tac ccc tea ttt atg atg att egg cct teg 3596 

299 Tyr Tyr Pro Ser Phe Met Met He Arg Pro Ser 

300 405 410 

302 aat gac cct caa aac ccc gca ccc tea acg gcg ctt gac cct ttc tea 3644 

303 Asn Asp Pro Gin Asn Pro Ala Pro Ser Thr Ala Leu Asp Pro Phe Ser 

304 415 420 425 

306 gga gac get cgt tta agg ata gec tct aag aag get gag ate acc gee 3692 

307 Gly Asp Ala Arg Leu Arg He Ala Ser Lys Lys Ala Glu He Thr Ala 

308 430 435 440 445 

310 get get ctt gtc agg aag aaa gee egg gat cac get aag tgg aga gag 3 740 

311 Ala Ala Leu Val Arg Lys Lys Ala Arg Asp His Ala Lys Trp Arg Glu 

312 450 455 460 

314 tec aag gga ttg cct ccg age gat ccg aac aag ccg gac aac teg gag 3 788 

315 Ser Lys Gly Leu Pro Pro Ser Asp Pro Asn Lys Pro Asp Asn Ser Glu 

316 465 470 475 

318 gat gtt aat tgg gta ttg ate ggc ggt atg ate gtt gga ttg ttg etc 3 836 

319 Asp Val Asn Trp Val Leu He Gly Gly Met He Val Gly Leu Leu Leu 

320 480 485 490 

322 gtg atg ggc gtg etc ggt ttg get ate get tgg gtt gtt ctt cag 3881 

323 Val Met Gly Val Leu Gly Leu Ala He Ala Trp Val Val Leu Gin 

324 495 500 505 

326 gtgcgttctt ccaaagagcc tttctctcat gaacacgcac ataggttgat ctaattctat 3 941 
328 cttactctgt catacag ttt gag caa taa tctcaagatt ctagtccatc 3990 
32 9 Phe Glu Gin 

330 510 

332 ctttcgctca aegatctget tcttctcctt ctccttctcc gtcttctctg gtttcttttc 4050 
334 ttactttctg ggatcttcct tcttgaatcc tccgatccaa tgtaatctgc ataccctcgc 4110 
336 tttagtagaa accgatcctt cattcgatct tggcgaaaat ctaagcaaag agaatcactt 4170 
338 ttgtctaata aaatttcctt taaagagtcg gctttttctt gtggcgaagc ttcatcccgt 4230 
340 cttcctctgg accatctctt ctcaatattc tttgtgctac tatatgatca agttctttga 4290 
342 aatcaaagaa gaacatgtat ttgattttga ggttccaaga atacaacegg cccaagtcgt 4350 
344 tettegcagt tttcatcaga cagcacatat ctctcctcct ctctatagaa gccgtatggg 4410 
346 gccaatcgac tctcatgggt agaccgtgcc cttttgacac ggggagaaag agaacgaaag 4470 
348 gacacttgac egattegtta ataaagcegt ccccaccttt tctttaatgg caattcaaga 4530 
350 agagaaaaac aacccctgcg cgcactcgag tagtcgatca gaccttccga acgacagata 4590 
352 teatttgetg aaatcgaccg gattttaaag ctgctgccag gtcggtgaat ccccctaggt 4650 
354 gatctccttg tacaaagatg ttgggcacgg acttttcgac ceggatgaga acgtcgtgaa 4710 
356 gagtttgaaa aagattatca acataatgtg tctttttttc ttttttcttt cgtaactctc 4770 
358 tagagaacga ggagaegtae ggtctgattt gttatcg 4807 
361 <210> SEQ ID NO: 2 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:4; N Pos. 3,6,12,15 
Seq#:5; N Pos. 3,9,15,18,21 
Seq#:6; N Pos . 3,6,9,12,15 

Invalid <213> Response; 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:4,5,6,7,8 
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PATENT APPLICATION: US/10/528,872 
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Input Set : A:\0185010SeqList.txt 
Output Set: N:\CRF4\04012005\J528872.raw 



L:9 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:168 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:671 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos . : 0 

L:709 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:5 after pos . : 0 

L:747 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 after pos . : 0 
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